Multiple query scheduling for distributed semantic caches
نویسندگان
چکیده
In distributed query processing systems, load balancing plays an important role in maximizing system throughput. When queries can leverage cached intermediate results, improving the cache hit ratio becomes as important as load balancing in query scheduling, especially when dealing with computationally expensive queries. The scheduling policies must be designed to take into consideration the dynamic contents of the distributed caching infrastructure. In this paper, we propose and discuss several distributed query scheduling policies that directly consider the available cache contents by employing distributed multidimensional indexing structures and an exponential moving average approach to predicting cache contents. These approaches are shown to produce better query plans and faster query response times than traditional scheduling policies that do not predict dynamic contents in distributed caches. We experimentally demonstrate the utility of the scheduling policies using MQO, which is a distributed, Grid-enabled, multiple query processing middleware system we developed to optimize query processing for data analysis and visualization applications. Preprint submitted to Journal of Parallel and Distributed Computing 27 January 2010
منابع مشابه
Answering Queries by Semantic Caches
There has been growing interest in semantic query caches to aid in query evaluation. Semantic caches are simply the results of previously asked queries, or selected relational information chosen by an evaluation strategy, that have been cached locally. For complex environments such as distributed, heterogeneous databases and data warehousing, the use of semantic caches promises to help optimize...
متن کاملDEMB: Cache-Aware Scheduling for Distributed Query Processing
Leveraging data in distributed caches for large scale query processing applications is becoming more important, given current trends toward building large scalable distributed systems by connecting multiple heterogeneous less powerful machines rather than purchasing expensive homogeneous and very powerful machines. As more servers are added to such clusters, more memory is available for caching...
متن کاملEM-KDE: A locality-aware job scheduling policy with distributed semantic caches
In modern query processing systems, the caching facilities are distributed and scale with the number of servers. To maximize the overall system throughput, the distributed system should balance the query loads among servers and also leverage cached results. In particular, leveraging distributed cached data is becoming more important as many systems are being built by connecting many small heter...
متن کاملSemantic Query Caching for Heterogeneous Databases
Query caching can play a vital role in heterogeneous , multi-database environments. Answers to a query that are available in cache at the local client can be returned to the user quickly, while the rest of the query is evaluated. The use of caches can optimize query evaluation. By caching certain sensitive data locally, caches can be used to answer the parts of queries that involve the sensitiv...
متن کاملAn intelligent query processing for distributed ontologies
0164-1212/$ see front matter 2009 Elsevier Inc. A doi:10.1016/j.jss.2009.06.008 * Corresponding author. Tel.: +82 42 350 3537; fax E-mail addresses: [email protected] (J. L (J.-H. Park), [email protected] (M.-J. Park) (C.-W. Chung), [email protected] (J.-K. Min). In this paper, we propose an intelligent distributed query processing method considering the characteristics of a distrib...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 70 شماره
صفحات -
تاریخ انتشار 2010